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Superlinear scaling in cities, which appears in sociological quantities such as economic produc- 
tivity and creative output relative to urban population size, has been observed but not been given 
a satisfactory theoretical explanation. Here we provide a network model for the superlinear rela- 
tionship between population size and innovation found in cities, with a reasonable range for the 
exponent. 
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I. INTRODUCTION 

It has been known for nearly a hundred years that liv- 
ing things obey scaling relationships. Max Kleiber first 
recognized that the metabolic rates of different mammals 
scale according to their masses raised to a 3/ 4-power [T] . 
More recently, Geoffrey West and his colleagues have pro- 
vided a theoretical explanation for this scaling law, as 
well as for many other allometric laws found in biology 
[5J [3] . Their theory is based upon the fractal branching 
networks (such as circulatory systems) found in all living 
things, whose function is to convey energy and nutrients 
to all parts of the organism. They argue that the larger 
the organism, the more efficient the system that can be 
constructed to provide energy, thereby yielding the ob- 
served sublinear exponent of 3/4. 

More recently. West and his team examined a vari- 
ety of properties of cities. They found that cities, which 
have long been compared to living things [H [51 [Bj , obey 
scaling relationships as well [7]. Similar to living things, 
cities have economics of scale, yielding sublinear scaling 
for such quantities as the number of gas stations within 
a city as a function of its population. In other words, 
you need fewer gas stations per person, in a bigger city. 
Examples of such scaling laws are shown in the upper 
portion of table |T] 

On the other hand, cities also exhibit superlinear scal- 
ing, which appears in relation to sociological quantities. 
As shown in the lower part of table |T] properties of cities 
related to economic productivity and creative output 
have exponents that are all found to cluster between 1 
and 1.5, with the mean around 1.2. Thus, the productiv- 



TABLE I: A variety of urban quantities and their exponents. 
For instance, if y denotes the number of gas stations in a city 
of population A'^, the data show y = cN" , with a ~ 0.77. 
Table from Bettencourt et al [?]■ 

Urban Indicators (y) Exponent (a) 



Gasoline stations 
Gasoline sales 
Length of electrical cables 
Road surface 



0.77 
0.79 
0.87 
0.83 



New patents 
Inventors 

Private R&D employment 

Supercreative employment 

R&D establishments 

R& D employment 

Total wages 

Total bank deposits 

GDP 



27 
25 
34 
15 
19 
26 
12 
08 
15 
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ity per person increases as a city gets larger. However, 
this superlinear scaling has not been given a satisfactory 
mathematical explanation. 

Here we suggest a theoretical explanation for the su- 
perlinear relationship between population size and inno- 
vation found in cities, with a reasonable range for the 
exponent. Due to the sociological nature of the variables 
being measured, it is natural to use a network model of 
a city, since it is reasonable to assume that network ef- 
fects must underlie the superlinear scaling, as West and 
his colleagues have suggested [71 [5]. For this, we draw on 
a recent class of models that derives superlinear scaling 
and densification properties from hierarchically organized 
networks |9], adapting these models to the present ques- 
tion of productivity. 
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FIG. 1: Network representation of the social structure of a 
city's inhabitants. For example, if we are determining dis- 
tance from individual A, then B is at distance 1, while G is 
at a distance of 3. 



II. MODEL AND RESULTS 

We first assume that all social interactions and rela- 
tionships are arranged in a hierarchical tree structure 
[HI [ini HJ. Picture a binary tree, or in general, a tree 
where each branch splits into b new branches. For exam- 
ple, in a city, each person is in a household, and there are 
many households on a block, and many blocks in a neigh- 
borhood, and so forth. Or the grouping could be based 
on your family tree, or corporations, or many other ways 
to group individuals. While in reality each individual 
belongs to many independent hierarchies [11', here we 
simplify it as a single hierarchy, with branching number 
6 > 2. We define the distance d between two individuals 
in this hierarchy to be the height of their lowest common 
ancestor. We view the total system as a city, meaning 
that a city of population TV represents a single tree that 
contains TV leaves. On top of the tree structure, which 
serves to determine the social distance among nodes, a 
random graph is placed showing the social connections 
— who actually knows whom. 

Our modeling strategy is to use these social connec- 
tions as the basis for a city's productivity: at a high 
level, we assume that each interaction between a pair of 
people contributes to the overall productivity, in a way 
that depends on the distance d as measured within the 
hierarchy. More concretely, our procedure for generating 
networks will produce a directed graph, and we will ac- 
count for the productivity benefits of each edge (w, w) by 
allocating it to v's overall productivity. This allocation 
to V rather than to both nodes is essentially for purposes 
of analysis, since we will be focusing in the end on the 
total population's productivity rather than any one indi- 
vidual's, and for determining this total we will see that 
it does not matter to which individual we allocate the 
benefits of the edge {v,w). 

The total creative productivity of the city is defined 
to be the sum of the productivities of each individual, 
and so we first consider how to compute individual pro- 
ductivities. To calculate the total productivity of a single 
person, three separate effects must be considered: (1) the 
probability of connecting to an individual at distance d; 
(2) the number of available people at distance d\ and (3) 



the creative output that is obtained by linking to a sin- 
gle person at distance d. Multiplying these together gives 
the productivity due to one person linking to all of his 
collaborators at distance d, as seen below: 



# contacts at d 
# people at d 



[# people at d] 



output at d 



contacts at d 



(1) 



By summing this term over all distances, the total cre- 
ative contribution of a single individual is obtained. The 
functional form of each term in the above recipe for calcu- 
lating the productivity of a single individual is discussed 
below. 

Taking the first term, the social connections between 
collaborators are constructed such that the Ukelihood of 
forming a connection at a certain social distance drops off 
exponentially fast with distance [HJIinilll]- That is, the 
probability of a connection being made between nodes 
of a social distance d (where d is the height of the first 
common internal node) is assumed proportional to b~°"^, 
where a is a tunable parameter greater than or equal to 
zero. 

It is natural that the connection probability should 
decay with social distance, but why exponentially? We 
have assumed that the social network tree is self-similar 
at all levels (values of d). Since the tree is self-similar, 
it makes sense to have the function also be self-similar 
(scale-free) with respect to the value of d, and doing this 
yields an exponential function (this assumption is relaxed 
in the next section). 

Since at each increase in d there are exponentially more 
potential contacts to interact with, we multiply the above 
function by a second term, b'^, which means that as we 
increase d, while the likelihood of making a connection 
decays, there are exponentially more contacts to make. 
To keep things simple, we suppose connections are only 
made between residents of the city (connections outside 
a city are viewed as contributing less directly to the city's 
total productivity, and are ignored). 

Lastly, the usefulness of a social connection within a 
city is assumed to vary with its social distance. For ex- 
ample, one could assume that there is a productivity ben- 
efit as social distance increases. This can be explained 
as being due to the fact that individuals that are socially 
distant are exposed to different ideas and experiences, 
and that collaboration between two more socially distant 
individuals is more productive than interaction between 
ones that are closer. However, the value of a social con- 
nection is left open, and simply assumed to be propor- 
tional to fe^"^, where /3 is a tunable parameter that can 
hold any value (even negative values, allowing the value 
of a connection to decrease with distance). An exponen- 
tial function is reasonable here as well, if we assume that 
a connection's innovation potential depends on the num- 
ber of individuals that lie between the two endpoints of 
the connection in social space. This assumption is also 
relaxed in the next section. 

The total productivity of the social connections within 
an TV-person city is now a random variable equal to the 
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sum of all the individual productivities. Its expectation 
P{N) is given by 

P{N) h-°"^b%l^'^ (2) 

d=l 

In summary, the first term, b^""^, is the probability of 
connecting at distance d. The second term, b''', is the 
number of nodes at distance d. And the final term, b^"^, 
is productivity per connection. So, when these are multi- 
plied together and then summed for each distance, they 
yield the expected productivity of one node in the full 
network. When multiplied by N we get the productivity 
for the entire network. 

This can be summed exactly since it is a finite geomet- 
ric series, and we get the following solution: 

= N (^^^^^N^-'^^^ - l) (3) 

For large values of N, we find P{N) is proportional to 
7V'^-"+2, iia <1 + p. On the other hand, ifa> 1 + P, 
the function P{N) becomes linear, because the geometric 
series converges to a constant as N becomes large: 

d=l 

This is found to be in good agreement with numerical 
evaluation of the above summation, as can be seen in 
Figure [2] 

The growth of the productivity function with distance 
is not essential. The key is, rather, that as a city increases 
in size, it is more likely to contain socially distant con- 
tacts. Having a < 1 means that the expected number of 
contacts scales superlinearly 0. Correspondingly, even 
if (3 is slightly negative (meaning that more distant con- 
nections are less productive), as long as P — a + 1 > 0, 
the densification of the social network due to the increas- 
ing size of the city means that the productivity will grow 
superlinearly. For the special case when /3 is zero (all con- 
nections are equally beneficial) , the exponent P ~ a + 2 
must lie strictly between 1 and 2, which is where all the 
measured exponents for urban innovation lie. This is 
because we are assuming that 0<a< 1 + P to get su- 
perlinear behavior, which means that a is between and 
1. 

Of course, other parameter relationships are also ca- 
pable of yielding the expected range of superlinear expo- 
nents. For P > 0, there is an exponentially decreasing 
probability of connecting to someone at a social distance 
d away, but connections at this distance confer a pro- 
ductivity benefit that is exponentially increasing in d. In 
general, this model is a reasonable explanation for the 
values observed within cities related to productivity and 
innovation, and can be fit properly to explain the super- 
linear exponents observed within cities. 
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FIG. 2: Simulation and fit for P{N). The points show the 
value of P{N), calculated for P = 0.3 and a = 1.1. The 
least-squares approximation of the exponent is 1.205, and the 
expected value is P — a + 2 — 1.2, for large values of A'^. 



III. EXPANSION OF THE ANALYSIS 

The assumptions of exponentials for the three func- 
tions that make up the sum discussed above are stringent 
ones. What happens if we relax these assumptions? 

Using a numerical simulation, each of the components 
of the sum can be modified, and we can graph the result- 
ing scaling relationship and see if it remains superlinear. 
And in fact, the model is robust under a variety of sit- 
uations. For example, instead of using an exponential 
for the creative benefit function, if we use a linearithmic 
function (dlnd), the resulting function asymptotically 
approaches a superlinear function, as seen in Figure |3] 
A similar superlinear result can be obtained by replac- 
ing the function for the number of nodes at distance d 
with a linearithmic function and leaving the other two 
functions exponential (by doing this, we are implicitly 
changing the structure of the social distance tree, such 
that the number of nodes no longer grows exponentially 
with distance). 

In fact, even if all three functions are linear, the 
sum still grows superlinearly with N, as seen in Figure 
[4] Indeed, if the function is proportional to d^, using 
the Euler-MacLaurin summation, we find that P{N) « 
A^(logj A)^, which grows a bit faster than linearly. 

However, if the average productivity per node grows 
with A^ but only at the rate log A^, then the rate of growth 
is only slightly superlinear, mimicking a power law expo- 
nent of 1.05, as seen in Figure |5] Slightly faster than 
logarithmic growth for the summation appears to be re- 
quired for superlinear growth of P{N). 

What can be seen is that using fairly loose assump- 
tions, superlinear growth can be obtained. Notably, these 
functions need not be power laws. They can simply be 
superhnear functions (such as P{N) « A^(log5 A^)"^), that 
mimic power laws. It is possible that this could be true 
of the observed city productivity data as well — that 
is, it is possible that the productivity functions observed 
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FIG. 3: A 'linearithmic' function. Using the function 
b~"'^b'''{d\nd) as the term within the sum for P{N) (where 
a = 0.6), the resulting function mimics a power law, with an 
exponent of 1.48. 
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FIG. 4: Linear Functions. Using the function (50 — ad){l + 
/3d) (2d) as the term within the sum for P{N) (where a — 0.4 
and /3 = 0.1), the resulting function still mimics a power law, 
with an exponent of 1.13. 
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FIG. 5: The function P(N) = Nlogi^N mimics a power law, 
with an exponent of 1.05. 



are super linear, but not necessarily power laws. Further 
measurement could help resolve this question. 



IV. DISCUSSION 

Ultimately, the heart of the model is the relation- 
ship between long-distance ties and productivity in large 
cities. These long-distance ties, which are prevalent in 
a higher proportion when there is a larger population, 
provide the potential for productive social interactions. 

Granovetter's classic paper 'The Strength of Weak 
Ties' considers this explicitly [T^. As part of his study, 
Granovetter examined the structure of Boston's West 
End and its inability to organize against a neighbor- 
hood urban renewal project, which included the large- 
scale destruction of buildings to make room for new res- 
idential high-rises [T3]. While the West End contained 
many strong ties, since most individuals had been life- 
long residents of the area, these strong ties often resulted 
in cliques, where everyone was connected within a sin- 
gle group. Crucially, however, Granovetter argues that 
there were few, if any, ties between these tightly-knit lo- 
cal cliques. Since personal ties are generally necessary 
for information spread and organizational ability (or as 
Granovetter put it, "people rarely act on mass-media in- 
formation unless it is also transmitted through personal 
ties," [H], p. 1374), the inhabitants of the West End 
would have had a great deal of difficulty in organizing 
their opposition to the municipal project. In contrast, 
if there had been interaction throughout the social hi- 
erarchy, such as between communities within the neigh- 
borhood, the outcome might have been much different. 
Along these lines, Charlestown, a similar Boston neigh- 
borhood, was able to successfully organize against urban 
renewal. Granovetter argues that there was a rich inter- 
connection between different communities, allowing for 
wider coordination. Alexander has similarly argued that 
rich interconnectivity between communities creates bet- 
ter cities [13]. 

Of course, any good model must be testable in order for 
it to rise above the level of a pleasant story. Additional 
work by the first author |15j has indirectly attempted to 
determine what the value of P is (it seems to be close 
to zero). Beyond this, given a network of social inter- 
action for a city, its hierarchical social structure could 
be determined [16], to see if it conforms to the type of 
growth with distance that is discussed above. This has 
not yet been done, but it should be feasible, given the 
relevant datasets. Furthermore, it would be interesting 
to consider models as well as empirical data that consider 
interactions at scales larger than within a single city, such 
as between cities or within entire geographical regions. 

In addition though, there are other possible explana- 
tions for this superlinear scaling in cities. For example, 
it could be that larger cities have a larger proportion 
of more highly educated individuals, which is enough to 
yield increased productivity per capita. Or it could be 
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that larger cities simply have a greater transient popu- 
lation, which provides more fodder for different ways of 
thinking about the world, yielding a higher rate of pro- 
ductivity per individual. By distinguishing between our 
model based on social interaction and other competing 
models, we can get a better sense of how good our model 
is. But how can this be done? 

While cities do exhibit superlinear scaling for a variety 
of quantities, many cities do not lie exactly on the pre- 
dicted curve for a given property, based on a curve of best 
fit. For example, some cities will produce more patents 
than expected, while others will produce far fewer than 
expected, given their population. By looking at the pat- 
tern of social interaction in the underperforming cities as 
compared to the overperforming cities, we can determine 
how reasonable our model is. And by examining how 
well other models can predict this type of variation, as 



opposed to ours, we can determine what is the likeliest 
explanation for superlinear scaling within cities. 

Nonetheless, as argued above, the presence of socially 
distant ties within a single city can be a powerful force. 
By using simple assumptions about social interactions, 
we gain a useful tool in understanding the mathematical 
behavior of innovation and productivity in cities. 
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